Enlarging multiword expression dataset by co-training
نویسندگان
چکیده
منابع مشابه
Multiword Expressions Dataset for Indian Languages
Multiword Expressions (MWEs) are used frequently in natural languages, but understanding the diversity in MWEs is one of the open problem in the area of Natural Language Processing. In the context of Indian languages, MWEs play an important role. In this paper, we present MWEs annotation dataset created for Indian languages viz., Hindi and Marathi. We extract possible MWE candidates using two r...
متن کاملInference Improvement by Enlarging the Training Set While Learning DFAs
A new version of the RPNI algorithm, called RPNI2, is presented. The main difference between them is the capability of the new one to extend the training set during the inference process. The effect of this new feature is specially notorious in the inference of languages generated from regular expressions and Non-deterministic Finite Automata (NFA). A first experimental comparison is done betwe...
متن کاملEnlarging Training Sets for Neural Networks
A study is presented to compare the performance of multilayer perceptrons, radial basis function networks, and probabilistic neural networks for classification. In many classification problems, probabilistic neural networks have outperformed other neural classifiers. Unfortunately, with this kind of networks, the number of required operations to classify one pattern directly depends on the numb...
متن کاملMultiword Expression Recognition
In the recent past, the important role played by multiword expressions in the language has been recognized by the natural language processing community. Simply put, a multiword expression (MWE) is a word collocation that exhibits markedly peculiar linguistic behaviour in terms of lexicalization, syntax or semantics. Among others, ubiquitous compound nouns, idioms and phrasal verbs fall into thi...
متن کاملSemantics-based Multiword Expression Extraction
This paper describes a fully unsupervised and automated method for large-scale extraction of multiword expressions (MWEs) from large corpora. The method aims at capturing the non-compositionality of MWEs; the intuition is that a noun within a MWE cannot easily be replaced by a semantically similar noun. To implement this intuition, a noun clustering is automatically extracted (using distributio...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: TURKISH JOURNAL OF ELECTRICAL ENGINEERING & COMPUTER SCIENCES
سال: 2018
ISSN: 1300-0632,1303-6203
DOI: 10.3906/elk-1709-185